IN THE CLAIMS 

Please amend the claims as follows. 

For the Examiner's convenience, a list of all claims is included below. 

1. (Currently Amended) A machine-implemented method comprising: 

extracting portions from time-domain speech segments, wherein the portions include on e 
or mor e pitch period s surrounding a segment boundary ef - within a phoneme; 

creating feature vectors that represent the portions in a vector space, the feature vectors 
incorporating phase information of the portions, wherein the creating feature vectors comprises 
constructing a matrix Wfrem -containing the portions surrounding the segment boundary within 
the phoneme ; and decomposing the matrix W containing the portions surrounding the segment 
boundary within the phoneme ; and 

determining a distance between the feature vectors in the vector space. 

2. (Canceled). 

3. (Previously Presented) The machine-implemented method of claim 1, wherein 
decomposing the matrix W comprises extracting global boundary-centric features from the 
portions. 

4. (Currently Amended) The machine-implemented method of claim 1, wherein the speech 
segments each include a-the segment boundary within a-fhe phoneme. 

5. (Original) The machine-implemented method of claim 4, wherein the speech segments 
each include at least one diphone. 
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6. (Original) The machine-implemented method of claim 5, wherein the portions include at 
least one pitch period. 

7. (Original) The machine-implemented method of claim 6, wherein decomposing the 
matrix W comprises performing a pitch synchronous singular value analysis on the pitch periods 
of the time-domain segments. 

8. (Previously Presented) The machine-implemented method of claim 6, wherein the matrix 
W is a 2KM x N matrix represented by 

W= UZV T 

where K is the number of pitch periods near the segment boundary extracted from each segment, 
N is the maximum number of samples among the pitch periods, M is the number of segments in 
a voice table having a segment boundary within the phoneme, U is the 2KM x R left singular 
matrix with row vectors m, (1 < i < 2KM), Z is the R x R diagonal matrix of singular values s\ > 
S2> ■■■> sr > 0, V is the Af x R right singular matrix with row vectors v y (1 < j < N), R « 
2KM, and T denotes matrix transposition, wherein decomposing the matrix W comprises 
performing a singular value decomposition of W. 

9. (Original) The machine-implemented method of claim 8, wherein the pitch periods are 
zero padded to N samples. 

10. (Original) The machine-implemented method of claim 9, wherein a feature vector w, is 
calculated as 

Hi - UjZ 
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where w, is a row vector associated with a pitch period i, and I is the singular diagonal matrix. 



11. (Original) The machine-implemented method of claim 10, wherein the distance between 
two feature vectors is determined by a metric comprising the cosine of the angle between the two 
feature vectors. 



12. (Original) The machine-implemented method of claim 11, wherein the metric comprises 
a closeness measure, C, between two feature vectors, and zi/ , wherein C is calculated as 

C(fi k , Ui) = COS(M,Z; U& = ; Uk f f 

for any 1< k, I < 2KM. 



13. (Original) The machine-implemented method of claim 12, wherein a difference d(S h S 2 ) 
between two segments in the voice table, Si and S 2 , is calculated as 

d(Si,S 2 ) = d Q (p u q\) = 1 - C(u P \ , u q \ ) 
where do is the distance between pitch periods p\ and q u pi is the last pitch period of S u q\ is the 
first pitch period of 52, u P \ is a feature vector associated with pitch period p\ , and u q \ is a 
feature vector associated with pitch period q\. 



14. (Original) The machine-implemented method of claim 13, wherein the calculation for 
the difference between two segments in the voice table, S\ and S 2 , is expanded to include a 
plurality of pitch periods from each segment. 
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15. (Original) The machine-implemented method of claim 13, wherein the difference 
between two segments in the voice table, Si and 52, is associated with a discontinuity between Si 
and S 2 ■ 

16. (Original) The machine-implemented method of claim 12, wherein a difference d(Si,S 2 ) 

between two segments in the voice table, Si and S2, is calculated as 

d(S h S 2 ) = | d 0 {pu q\) - doipi, p ,) + d 0 (q u q ,) | = | C(w„i ,upi) + C(w?i ,Uqi) - C{u P \ ,u g i)\ 
2 2 

where do is the distance between pitch periods, p\ is the last pitch period of Si , p , is the first 

pitch period of a segment contiguous to Si , q\ is the first pitch period of S2 , q 1 is the last pitch 

period of a segment contiguous to S 2 , u P i is a feature vector associated with pitch period p\ , u q i 

is a feature vector associated with pitch period q\ , up\ is a feature vector associated with pitch 

period p 1 , and w>i is a feature vector associated with pitch period q , . 

17. (Previously Presented) The machine-implemented method of claim 1, further comprising 
associating the distance between the feature vectors with speech segments in a voice table. 

18. (Original) The machine-implemented method of claim 17, further comprising: 
selecting speech segments from the voice table based on the distance between the feature 

vectors. 

19. (Original) The machine-implemented method of claim 5, wherein the portions include 
centered pitch periods. 
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20. (Previously Presented) The machine-implemented method of claim 19, wherein the 
matrix Wis a (2(K-\)+l)M x N matrix represented by 
W= UIV T 

where AM is the number of centered pitch periods near the segment boundary extracted from 
each segment, N is the maximum number of samples among the centered pitch periods, M is the 
number of segments in a voice table having a segment boundary within the phoneme, U is the 
(2(AM)+1)M x R left singular matrix with row vectors «, (1 < i < (2{K-\)+\)M), 27 is the R x R 
diagonal matrix of singular values s\> s 2 > . . . > s R > 0, V is the N x R right singular matrix 
with row vectors v, (l < j < N), R « (2(#-l)+l)M), and T denotes matrix transposition, 
wherein decomposing the matrix W comprises performing a singular value decomposition of W. 



21 . (Original) The machine-implemented method of claim 20, wherein the centered pitch 
periods are symmetrically zero padded to N samples. 



22. (Original) The machine-implemented method of claim 21, wherein a feature vector m, is 
calculated as 

M, = UjL 

where w, is a row vector associated with a centered pitch period i, and 27 is the singular diagonal 
matrix. 



23. (Original) The machine-implemented method of claim 22, wherein the distance between 
two feature vectors is determined by a metric comprising a closeness measure, C, between two 
feature vectors, u k and w/ , wherein C is calculated as 

C(u k , «,) = cos^iT, Ul E) = Uk T« l \ h 
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for any 1< k,l< (2(K-\)+\)M. 



24. (Original) The machine-implemented method of claim 23, wherein a difference d{S\,S 2 ) 
between two segments in the voice table, S\ and 52, is calculated as 

d(Si,S 2 ) = C( Ujt-\, USo) + C(USo, U(j\) - C( Un-x,U7C 0 )- C(U(Ja> U(ji) 
where Uk-\ is a feature vector associated with a centered pitch period 71 -\ ,uSo is a feature 
vector associated with a centered pitch period So ,U(j\ is a feature vector associated with a 
centered pitch period <Ti,U^ 0 is a feature vector associated with a centered pitch period fto , and 
U(j 0 is a feature vector associated with a centered pitch period (To . 

25. (Currently Amended) A machine-readable medium having instructions to cause a 
machine to perform a machine-implemented method comprising: 

extracting portions from time-domain speech segments that include time samples of pitch 




creating feature vectors that represent the portions in a vector space, the feature vectors 
incorporating phase information of the portions, wherein the creating feature vectors comprises 
constructing a matrix W frem- containing the portions surrounding the segment boundary within 
the phoneme ; and decomposing the matrix W containing the portions surrounding the segment 
boundary within the phoneme ; and 

determining a distance between the feature vectors in the vector space. 





a segment boundary of -within a phoneme; 



26. (Canceled). 



10/693,227 



-7/31- 



4860P3128 



27. (Previously Presented) The machine-readable medium of claim 25, wherein 
decomposing the matrix W comprises extracting global boundary-centric features from the 
portions. 

28. (Currently Amended) The machine-readable medium of claim 25, wherein the speech 
segments each include [[a]] the segment boundary within [[a]] the phoneme. 

29. (Original) The machine-readable medium of claim 28, wherein the speech segments each 
include at least one diphone. 

30. (Original) The machine-readable medium of claim 29, wherein the portions include at 
least one pitch period. 

3 1 . (Original) The machine-readable medium of claim 30, wherein decomposing the matrix 
W comprises performing a pitch synchronous singular value analysis on the pitch periods of the 
time-domain segments. 
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32. (Previously Presented) The machine-readable medium of claim 30, wherein the matrix W 
is a 2KM x N matrix represented by 

W=UZV T 

where K is the number of pitch periods near the segment boundary extracted from each segment, 
N is the maximum number of samples among the pitch periods, M is the number of segments in a 
voice table having a segment boundary within the phoneme, U is the 2KM x R left singular 
matrix with row vectors w, (1 < / < 2KM), 2" is the R x R diagonal matrix of singular values si > 
S2> ■■■> s R > 0, V is the N x R right singular matrix with row vectors v, (1 < j< N), R « 
2KM, and r denotes matrix transposition, wherein decomposing the matrix W comprises 
performing a singular value decomposition of W. 

33. (Original) The machine-readable medium of claim 32, wherein the pitch periods are zero 
padded to N samples. 

34. (Original) The machine-readable medium of claim 33, wherein a feature vector w, is 
calculated as 

w, = UjZ 

where w, is a row vector associated with a pitch period i, and I is the singular diagonal matrix. 

35. (Original) The machine-readable medium of claim 34, wherein the distance between two 
feature vectors is determined by a metric comprising the cosine of the angle between the two 
feature vectors. 



10/693,227 



-9/31 - 



4860P3128 



36. (Original) The machine-readable medium of claim 35, wherein the metric comprises a 
closeness measure, C, between two feature vectors, u k and «/ , wherein C is calculated as 

\\u k Z\\ \\u,Z\\ 

for any !</:,/< 2/CM. 



37. (Original) The machine-readable medium of claim 36, wherein a difference d(Si,S 2 ) 
between two segments in the voice table, S\ and 52, is calculated as 

d(S h S 2 ) = doipu qi) = 1 - C(w„i , ) 
where d 0 is the distance between pitch periods p { and ^i, p\ is the last pitch period of Si, q\ is the 
first pitch period of S 2 , u P i is a feature vector associated with pitch period p\ , and u q i is a 
feature vector associated with pitch period q\. 

38. (Original) The machine-readable medium of claim 37, wherein the calculation for the 
difference between two segments in the voice table, Si and 52, is expanded to include a plurality 
of pitch periods from each segment. 



39. (Original) The machine-readable medium of claim 37, wherein the difference between 
two segments in the voice table, Si and 5 2 , is associated with a discontinuity between Si and S 2 . 
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40. (Original) The machine-readable medium of claim 36, wherein a difference d(S\,S2) 

between two segments in the voice table, Si and S 2 , is calculated as 

d(S h S 2 ) = | d 0 (p u qd - d 0 (p u p ,) + d 0 (q u q ,) | = | C(u„\ , w ) + C( u q \ ,u<j\)- C( u P \ , u,i ) | 
2 2 

where d Q is the distance between pitch periods, p\ is the last pitch period of Si , p , is the first 

pitch period of a segment contiguous to Si , q x is the first pitch period of S 2 , ^ i is the last pitch 

period of a segment contiguous to 52 , u P i is a feature vector associated with pitch period pi , u q \ 

is a feature vector associated with pitch period qi , u P \ is a feature vector associated with pitch 

period p , , and \iq\ is a feature vector associated with pitch period q \ . 

41 . (Previously Presented) The machine-readable medium of claim 25, wherein the method 
further comprises associating the distance between the feature vectors with speech segments in a 
voice table. 

42. (Original) The machine- readable medium of claim 41, wherein the method further 
comprises: 

selecting speech segments from the voice table based on the distance between the feature 

vectors. 

43. (Original) The machine-readable medium of claim 29, wherein the portions include 
centered pitch periods. 
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44. (Previously Presented) The machine-readable medium of claim 43, wherein the matrix W 
is a (2(K-l)+l)M x N matrix represented by 

W= UZV T 

where £-1 is the number of centered pitch periods near the segment boundary extracted from 
each segment, N is the maximum number of samples among the centered pitch periods, M is the 
number of segments in a voice table having a segment boundary within the phoneme, U is the 
(2(AM)+1)M x R left singular matrix with row vectors u, (1 < i< (2(K-l )+l)M), S is the R x R 
diagonal matrix of singular values s i > s 2 > . . . > s R > 0, V is the N x R right singular matrix 
with row vectors vj(l< j< N), R « (2(K-\ )+l)M), and T denotes matrix transposition, 
wherein decomposing the matrix W comprises performing a singular value decomposition of W. 

45. (Original) The machine-readable medium of claim 44, wherein the centered pitch 
periods are symmetrically zero padded to N samples. 



46. (Original) The machine-readable medium of claim 45, wherein a feature vector u-, is 
calculated as 

Hi = UjZ 

where m, is a row vector associated with a centered pitch period i, and I is the singular diagonal 
matrix. 



47. (Original) The machine-readable medium of claim 46, wherein the distance between two 
feature vectors is determined by a metric comprising a closeness measure, C, between two 
feature vectors, and ii\ , wherein C is calculated as 

c(u k , ad = cos(«*r, u,Z) = - Uk f"' T j. 

Ml II"' 1 II 
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for any 1< k,l< (2(£-l)+l)M. 



48. (Original) The machine-readable medium of claim 47, wherein a difference d(S h S2) 
between two segments in the voice table, Si and 52, is calculated as 

d(S u S 2 ) = C( Un-u uSo) + C( uSo, U(jx)- C( UTC-x, Ujto) - C( Uao, U 0 x) 
where W/T-i is a feature vector associated with a centered pitch period 7t-\ , U So is a feature 
vector associated with a centered pitch period So , U cr l is a feature vector associated with a 
centered pitch period (Ji , Uko is a feature vector associated with a centered pitch period 71 o , and 
Uao is a feature vector associated with a centered pitch period Go . 

49. (Currently Amended) An apparatus comprising: 

means for extracting portions from time-domain speech segments, wherein the portions 
include one or more p i tch periods surrounding a segment boundary ef- within a phoneme; 
means for creating feature vectors that represent the portions in a vector space, the feature 
vectors incorporating phase information of the portions, wherein means for creating feature 
vectors comprises means for constructing a matrix Wfrom- containing the portions surrounding 
the segment boundary within the phoneme ; and means for decomposing the matrix W containing 
the portions surrounding the segment boundary within the phoneme ; and 

means for determining a distance between the feature vectors in the vector space. 

50. (Canceled). 
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5 1 . (Previously Presented) The apparatus of claim 49, wherein the means for decomposing 
the matrix W comprises means for extracting global boundary-centric features from the portions. 

52. (Currently Amended) The apparatus of claim 49, wherein the speech segments each 
include [[a]] the segment boundary within [[a]] the phoneme. 

53. (Original) The apparatus of claim 52, wherein the speech segments each include at least 
one diphone. 

54. (Original) The apparatus of claim 53, wherein the portions include at least one pitch 
period. 

55. (Original) The apparatus of claim 54, wherein the means for decomposing the matrix W 
comprises means for performing a pitch synchronous singular value analysis on the pitch periods 
of the time-domain segments. 
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56. (Previously Presented) The apparatus of claim 54, wherein the matrix Wis a 2KM x N 
matrix represented by 

W= UZV T 

where K is the number of pitch periods near the segment boundary extracted from each segment, 
N is the maximum number of samples among the pitch periods, M is the number of segments in a 
voice table having a segment boundary within the phoneme, U is the 2KM x R left singular 
matrix with row vectors m, (1 < i < 2KM),Z\s the R x R diagonal matrix of singular values S] > 
S2> ■■■> s R > 0, V is the N x R right singular matrix with row vectors y, (1 < j < N), R « 
2KM, and T denotes matrix transposition, wherein decomposing the matrix W comprises 
performing a singular value decomposition of W. 

57. (Original) The apparatus of claim 56, wherein the pitch periods are zero padded to 
samples. 

58. (Original) The apparatus of claim 57, wherein a feature vector w, is calculated as 
where w, is a row vector associated with a pitch period i, and Z is the singular diagonal matrix. 



59. (Original) The apparatus of claim 58, wherein the distance between two feature vectors 
is determined by a metric comprising the cosine of the angle between the two feature vectors. 



60. (Original) The apparatus of claim 59, wherein the metric comprises a closeness measure, 
C, between two feature vectors, w^and ui , wherein C is calculated as 

c(u k , a,) = cos( M *r, u& = j, u lf»'\,. 
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for any 1< k,l< 2KM. 



61. (Original) The apparatus of claim 60, wherein a difference d(Si,S 2 ) between two 
segments in the voice table, Si and S 2 , is calculated as 

d(Si,S 2 ) = d 0 (pu qi) = l- C{u P \ , u„\ ) 
where d 0 is the distance between pitch periods p\ and q\, p\ is the last pitch period of Si, q\ is the 
first pitch period of 52, u P \ is a feature vector associated with pitch period p\ , and u q \ is a 
feature vector associated with pitch period q\. 

62. (Original) The apparatus of claim 61, wherein the calculation for the difference between 
two segments in the voice table, Si and S 2 , is expanded to include a plurality of pitch periods 
from each segment. 

63. (Original) The apparatus of claim 61, wherein the difference between two segments in 
the voice table, Si and S 2 , is associated with a discontinuity between Si and S 2 . 

64. (Original) The apparatus of claim 60, wherein a difference d(S\,S 2 ) between two 
segments in the voice table, Si and S 2 , is calculated as 

d(S u S 2 ) = | d 0 (pu q\)-d Q (p u Pi) + d 0 {q u q ,) | = | C( u P i , u P \ ) + C(ZT,i ,Ugi) - C(u P \,u g \ ) | 
2 2 

where d 0 is the distance between pitch periods, p\ is the last pitch period of Si , p \ is the first 

pitch period of a segment contiguous to Si , q\ is the first pitch period of S 2 , q i is the last pitch 

period of a segment contiguous to S 2 , u P \ is a feature vector associated with pitch period p\ , u,\ 

is a feature vector associated with pitch period q x , u p i is a feature vector associated with pitch 

period p , , and Uqi is a feature vector associated with pitch period q } . 
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65. (Previously Presented) The apparatus of claim 49, further comprising means for 
associating the distance between the feature vectors with speech segments in a voice table. 

66. (Original) The apparatus of claim 65, further comprising: 

means for selecting speech segments from the voice table based on the distance between 
the feature vectors. 

67. (Original) The apparatus of claim 53, wherein the portions include centered pitch 
periods. 

68. (Previously Presented) The apparatus of claim 67, wherein the matrix W is a 
(2(/sT-l)+l)M x N matrix represented by 

W= UZV T 

where £-1 is the number of centered pitch periods near the segment boundary extracted from 
each segment, N is the maximum number of samples among the centered pitch periods, M is the 
number of segments in a voice table having a segment boundary within the phoneme, U is the 
(2(K-l)+l)M x R left singular matrix with row vectors w, (1 < i < (2{K-l)+\)M), 2" is the R x R 
diagonal matrix of singular values S[ > s 2 > . . .> s R > 0, V is the N x R right singular matrix 
with row vectors vj (1 < j < N), R « (2(K-l )+l)M), and T denotes matrix transposition, 
wherein decomposing the matrix W comprises performing a singular value decomposition of W. 

69. (Original) The apparatus of claim 68, wherein the centered pitch periods are 
symmetrically zero padded to N samples. 
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70. (Original) The apparatus of claim 69, wherein a feature vector w, is calculated as 

m, = w,2T 

where «, is a row vector associated with a centered pitch period i, and E is the singular diagonal 
matrix. 

71. (Original) The apparatus of claim 70, wherein the distance between two feature vectors 
is determined by a metric comprising a closeness measure, C, between two feature vectors, u* 
and «/ , wherein C is calculated as 

C(u k , u t ) = cos(«,r, Ul E) = U 'f»'\« 
\\u k Z\\ \\ Ul Z\\ 

for any 1< k,l< (2(K-l )+l)M. 

72. (Original) The apparatus of claim 71, wherein a difference d{S\,S 2 ) between two 
segments in the voice table, S\ and S2, is calculated as 

d{S\,S 2 ) = C( U7t-\,U§ 0 ) + C( USo, W<7!) - C( , Uffo ) - C( M (Jo, W(T.) 
where M is a feature vector associated with a centered pitch period 7t-\ , W <5 0 is a feature 
vector associated with a centered pitch period <5o , U (j x is a feature vector associated with a 
centered pitch period <Ti,Uft 0 is a feature vector associated with a centered pitch period 71 0 , and 
U (Jq is a feature vector associated with a centered pitch period <7o . 

73. (Currently Amended) A system comprising: 

a processing unit coupled to a memory through a bus; and 
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wherein the processing unit is configured, for a process, to extract portions from time-domain 
speech segments, wherein the portions includ e one or more pitch periods surrounding a segment 
boundary ef -within a phoneme, create feature vectors that represent the portions in a vector 
space, wherein the processing unit is configured, when creating feature vectors, to construct a 
matrix Wfrem -containing the portions surrounding the segment boundary within the phoneme , 
and decompose the matrix W containing the portions surrounding the segment boundary within 
the phoneme , the feature vectors incorporating phase information of the portions, and determine 
a distance between the feature vectors in the vector space. 

74. (Canceled). 

75. (Previously Presented) The system of claim 73, wherein the process further causes the 
processing unit, when decomposing the matrix W, to extract global boundary-centric features 
from the portions. 

76. (Currently Amended) The system of claim 73, wherein the speech segments each include 
[[a]] the segment boundary within [[a]] the phoneme. 

77. (Original) The system of claim 76, wherein the speech segments each include at least 
one diphone. 

78. (Original) The system of claim 77, wherein the portions include at least one pitch period. 
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79. (Original) The system of claim 78, wherein the process further causes the processing 
unit, when decomposing the matrix W, to perform a pitch synchronous singular value analysis on 
the pitch periods of the time-domain segments. 

80. (Previously Presented) The system of claim 78, wherein the matrix W is a 2KM x N 
matrix represented by 

W= UZV T 

where K is the number of pitch periods near the segment boundary extracted from each segment, 
N is the maximum number of samples among the pitch periods, M is the number of segments in a 
voice table having a segment boundary within the phoneme, U is the 2KM x R left singular 
matrix with row vectors u, (1 < i < 2KM), I is the R x R diagonal matrix of singular values si > 
S2> • • • > s R > 0, V is the N x R right singular matrix with row vectors y, (1 < j< N), R « 
2KM, and 7 denotes matrix transposition, wherein decomposing the matrix W comprises 
performing a singular value decomposition of W. 

8 1 . (Original) The system of claim 80, wherein the pitch periods are zero padded to N 
samples. 

82. (Original) The system of claim 81, wherein a feature vector m, is calculated as 

iii = UjZ 

where w, is a row vector associated with a pitch period t, and Z is the singular diagonal matrix. 

83. (Original) The system of claim 82, wherein the distance between two feature vectors is 
determined by a metric comprising the cosine of the angle between the two feature vectors. 
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84. (Original) The system of claim 83, wherein the metric comprises a closeness measure, C, 
between two feature vectors, u k and w/ , wherein C is calculated as 

Qfit , u,) = cosiutZ, u,Z) = U 'f«'\„ 
II"* 27 !! II"' 27 II 

for any 1< k,l< 2KM. 



85. (Original) The system of claim 84, wherein a difference d(Si,S 2 ) between two segments 
in the voice table, Si and S 2 , is calculated as 

d(S u S 2 ) = doipu q\)=\- C(u P \ , u g \ ) 
where do is the distance between pitch periods p\ and q u p\ is the last pitch period of S\, q\ is the 
first pitch period of S 2 , "pi is a feature vector associated with pitch period p x , and u q \ is a 
feature vector associated with pitch period q\. 



86. (Original) The system of claim 85, wherein the calculation for the difference between 
two segments in the voice table, S\ and S 2 , is expanded to include a plurality of pitch periods 
from each segment. 



87. (Original) The system of claim 85, wherein the difference between two segments in the 
voice table, Si and S 2 , is associated with a discontinuity between Si and S 2 . 
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88. (Original) The system of claim 84, wherein a difference d(S\,S2) between two segments 
in the voice table, Si and S 2 , is calculated as 

d(S h S 2 ) = | </o(Pi, <?i) - d 0 (p u + d 0 (q u q ,) | = | C(u P i ,u?i) + C(u g \,Uqi)- C(u P \,u q \ ) | 
2 2 

where d 0 is the distance between pitch periods, p\ is the last pitch period of S\ , /? , is the first 

pitch period of a segment contiguous to Si , q\ is the first pitch period of S 2 , q i is the last pitch 

period of a segment contiguous to S 2 , u P \ is a feature vector associated with pitch period p\ , u q \ 

is a feature vector associated with pitch period q\ , w is a feature vector associated with pitch 

period p i , and uq\ is a feature vector associated with pitch period q i . 

89. (Previously Presented) The system of claim 74, wherein the process further causes the 
processing unit to associate the distance between the feature vectors with speech segments in a 
voice table. 

90. (Original) The system of claim 89, wherein the process further causes the processing unit 
to select speech segments from the voice table based on the distance between the feature vectors. 

91. (Original) The system of claim 77, wherein the portions include centered pitch periods. 

92. (Previously Presented) The system of claim 91, wherein the matrix Wis a (2(^-1 )+l)M 
x N matrix represented by 

W=UEV T 

where K-l is the number of centered pitch periods near the segment boundary extracted from 
each segment, Af is the maximum number of samples among the centered pitch periods, M is the 
number of segments in a voice table having a segment boundary within the phoneme, U is the 
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(2(AM)+1)M x R left singular matrix with row vectors u, (1 < i< (2(/sT-l)+l)M), 27is the R x R 
diagonal matrix of singular values si > s 2 > ...> s R > 0, V is the N x R right singular matrix 
with row vectors v, (1 < j < N), R « (2(tf-l)+l)Af), and T denotes matrix transposition, 
wherein decomposing the matrix W comprises performing a singular value decomposition of W. 

93. (Original) The system of claim 92, wherein the centered pitch periods are symmetrically 
zero padded to N samples. 

94. (Original) The system of claim 93, wherein a feature vector w, is calculated as 

«; = UiZ 

where m, is a row vector associated with a centered pitch period i, and E is the singular diagonal 
matrix. 



95. (Original) The system of claim 94, wherein the distance between two feature vectors is 
determined by a metric comprising a closeness measure, C, between two feature vectors, w* and 
Hi , wherein C is calculated as 

C(u k , u,) = cos(u k Z, u,Z) = ,, ^f 2 "'',, 
II"* 27 1 II"' 27 II 

for any 1< k, I < {2{K-\)+\)M. 

96. (Original) The system of claim 95, wherein a difference d{S\,Si) between two segments 
in the voice table, S\ and 52, is calculated as 

d(S lt S 2 ) = C( U7t-\ ,USo) + C(USo,U(j i )-C(Un;-i,U^o)-C(Uc7o,Uai) 



10/693,227 



-23/31- 



4860P3128 



where W/T-i is a feature vector associated with a centered pitch period 71 -\ ,USo is a feature 
vector associated with a centered pitch period So , UO\ is a feature vector associated with a 
centered pitch period G\,Uno is a feature vector associated with a centered pitch period 7to , and 
U(jo is a feature vector associated with a centered pitch period Co. 

97. (Currently Amended) A machine-implemented method comprising: 

gathering time-domain samples from recorded speech segments, wherein the time- 
domain samples include time samples of pitch periods surrounding a segment boundary ef -within 
a phoneme; 

extracting features that represent the time-domain samples, wherein the extracting 
features comprises constructing a matrix frem- containing the time-domain samples of the pitch 
periods surrounding the segment boundary within the phoneme and decomposing the matrix 
containing the time domain samples of the pitch periods surrounding the segment boundary 
within the phoneme ; 

determining a discontinuity between the segments, the discontinuity based on a distance 
between the features. 

98. (Canceled). 

99. (Previously Presented) The machine-implemented method of claim 97, wherein the 
features incorporate phase information of the pitch periods. 

100. (Canceled). 
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101 . (Currently Amended) A machine-readable medium having instructions to cause a 
machine to perform a machine-implemented method comprising: 

gathering time-domain samples from recorded speech segments, wherein the time- 
domain samples include time samples of pitch periods surrounding a segment boundary e £withjn 
a phoneme; 

extracting features that represent the time-domain samples, wherein the extracting 
features comprises constructing a matrix frem- containing the time-domain samples of the pitch 
periods surrounding the segment boundary within the phoneme and decomposing the matrix 
containing the time domain samples of the pitch periods surrounding the segment boundary 
within the phoneme ; 

determining a discontinuity between the segments, the discontinuity based on a distance 
between the features. 

102. (Canceled). 

103. (Previously Presented) The machine-readable medium of claim 101, wherein the features 
incorporate phase information of the pitch periods. 

104. (Canceled). 

105. (Currently Amended) An apparatus comprising: 

means for gathering time-domain samples from recorded speech segments, wherein the 
time-domain samples include time samples of pitch periods surrounding a segment boundary ef 
within a phoneme; 
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means for extracting features that represent the time-domain samples, wherein the means 
for extracting features comprises means for constructing a matrix frem- containing the time- 
domain samples of the pitch periods surrounding the segment boundary within the phoneme and 
decomposing the matrix containing the time domain samples of the pitch periods surrounding the 
segment boundary within the phoneme ; 

means for determining a discontinuity between the segments, the discontinuity based on a 
distance between the features. 

106. (Canceled). 

107. (Previously Presented) The apparatus of claim 105, wherein the features incorporate 
phase information of the pitch periods. 

108. (Canceled). 

109. (Currently Amended) A system comprising: 

a processing unit coupled to a memory through a bus; and 
a process executed from the memory by the processing unit to cause the processing unit to gather 
time-domain samples from recorded speech segments, wherein the time-domain samples include 
time samples of pitch periods surrounding a segment boundary o£ - within a phoneme, extract 
features that represent the time-domain samples, wherein the extracting features comprises 
constructing a matrix frem- containing the time-domain samples of the pitch periods surrounding 
the segment boundary within the phoneme and decomposing the matrix containing the time 
domain samples of the pitch periods surrounding the segment boundary within the phoneme , and 
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determine a discontinuity between the segments, the discontinuity based on a distance between 
the features. 

110. (Canceled). 

111. (Previously Presented) The system of claim 109, wherein the features incorporate phase 
information of the pitch periods. 

112. (Canceled). 
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